Plant Phenomics

Research Article | Open Access

Volume 2025 |Article ID 100005 | https://doi.org/10.1016/j.plaphe.2025.100005

The blessing of Depth Anything: An almost unsupervised approach to crop segmentation with depth-informed pseudo labeling

Songliang Cao,^1,2,4,6 Binghui Xu,^1,2,4,6 Wei Zhou,¹ Letian Zhou,¹ Jiafei Zhang,^2,4,5 Yuhui Zheng,^2,4,5 Weijuan Hu ,³ Zhiguo Han ,^2,4,5 and Hao Lu ¹

¹National Key Laboratory of Multispectral Information Intelligent Processing Technology, School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan, 430074, China
²PhenoTrait Technology Co., Ltd., Beijing, 100096, China
³State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, 100101, China
⁴MetaPheno Laboratory, Shanghai, 201114, China
⁵SpeCloud Technology Co., Ltd., Sanya, 572025, China
⁶These authors contributed equally to this work. Work done during internship at PhenoTrait Technology Co., Ltd.
⁷In this paper, we slightly abuse the term ‘crop’ as we do not discriminate crop and weed and consider general greenness extraction.

Received 04 Sep 2024	Accepted 10 Dec 2024	Published 27 Feb 2025

Abstract

We present Depth-Informed Crop Segmentation (DepthCropSeg), an almost unsupervised crop segmentation approach without manual pixel-level annotations. Crop segmentation is a fundamental vision task in agriculture, which benefits a number of downstream applications such as crop growth monitoring and yield estimation. Over the past decade, image-based crop segmentation approaches have shifted from classic color-based paradigms to recent deep learning-based ones. The latter, however, rely heavily on large amounts of data with high-quality manual annotation such that considerable human labor and time are spent. In this work, we leverage Depth Anything V2, a vision foundation model, to produce high-quality pseudo crop masks for training segmentation models. We compile a dataset of 17,199 images from six public plant segmentation sources, generating pseudo masks from depth maps after normalization and thresholding. After a coarse-to-fine manual screening, 1378 images with reliable masks are selected. We compare four semantic segmentation models and enhance the top-performing one with depth-informed two-stage self-training and depth-informed post-processing. To evaluate the feasibility and robustness of DepthCropSeg, we benchmark the segmentation performance on 10 public crop segmentation testing sets and a self-collect dataset covering in-field, laboratory, and unmanned aerial vehicle (UAV) scenarios. Experimental results show that our DepthCropSeg approach can achieve crop segmentation performance comparable to the fully supervised model trained with manually annotated data (86.91 vs. 87.10). For the first time, we demonstrate almost unsupervised, close-to-full-supervision crop segmentation successfully.

Research Article | Open Access

Volume 2025 |Article ID 100005 | https://doi.org/10.1016/j.plaphe.2025.100005

The blessing of Depth Anything: An almost unsupervised approach to crop segmentation with depth-informed pseudo labeling

Songliang Cao,^1,2,4,6 Binghui Xu,^1,2,4,6 Wei Zhou,¹ Letian Zhou,¹ Jiafei Zhang,^2,4,5 Yuhui Zheng,^2,4,5 Weijuan Hu ,³ Zhiguo Han ,^2,4,5 and Hao Lu ¹

Abstract

Fulltext

PDF

Submit Manuscript

Research Article | Open Access

Volume 2025 |Article ID 100005 | https://doi.org/10.1016/j.plaphe.2025.100005

The blessing of Depth Anything: An almost unsupervised approach to crop segmentation with depth-informed pseudo labeling

Songliang Cao,1,2,4,6 Binghui Xu,1,2,4,6 Wei Zhou,1 Letian Zhou,1 Jiafei Zhang,2,4,5 Yuhui Zheng,2,4,5 Weijuan Hu ,3 Zhiguo Han ,2,4,5 and Hao Lu 1

Abstract

Fulltext

PDF

Submit Manuscript

Songliang Cao,^1,2,4,6 Binghui Xu,^1,2,4,6 Wei Zhou,¹ Letian Zhou,¹ Jiafei Zhang,^2,4,5 Yuhui Zheng,^2,4,5 Weijuan Hu ,³ Zhiguo Han ,^2,4,5 and Hao Lu ¹